Performance Oriented Schema Matching
نویسندگان
چکیده
Semantic matching of schemas in heterogeneous data sharing systems is time consuming and error prone. Existing mapping tools employ semi-automatic techniques for mapping two schemas at a time. In a large-scale scenario, where data sharing involves a large number of data sources, such techniques are not suitable. We present a new robust mapping method which creates a mediated schema tree from a large set of input XML schema trees and defines mappings from the contributing schema to the mediated schema. The result is an almost automatic technique giving good performance with approximate semantic match quality. Our method uses node ranks calculated by pre-order traversal. It combines tree mining with semantic label clustering which minimizes the target search space and improves performance, thus making the algorithm suitable for large scale data sharing. We report on experiments with up to 80 schemas containing 83,770 nodes, with our prototype implementation taking 587 seconds to match and merge them to create a mediated schema and to return mappings from input schemas to the mediated schema.
منابع مشابه
An Improved Semantic Schema Matching Approach
Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...
متن کاملAn Indexing Structure for Automatic Schema Matching
Querying semantically related data sources depends on the ability to map between their schemas. Unfortunately, in most cases matching between schema is still largely performed manually or semi-automatically. Consequently, the issue of finding semantic mappings became the principal bottleneck in the deployment of the mediation systems in large scale where the number of ontologies and or schemata...
متن کاملThe Effectiveness of Integrated Schema Oriented Therapy and Young’s Schema Therapy on Perception of Exclusion among individuals with Borderline Personality Characteristics
Background& Aims: Personality pathological symptoms are the ones that require the attention of psychological therapists. Borderline personality characteristics due to its significant prevalence, as a personality trait, require the attention of therapists. Accordingly, the aim of this study was to determine the effectiveness of integrated schema oriented therapy and schema therapy on perception ...
متن کاملPORSCHE: Performance ORiented SCHEma Matching
Semantic matching of schemas in heterogeneous data sharing systems is time consuming and error prone. Existing mapping tools employ semi-automatic techniques for mapping two schemas at a time. In a large-scale scenario, where data sharing involves a large number of data sources, such techniques are not suitable. In this paper we present a method, which creates a mediated schema tree from a larg...
متن کاملComparison of the Effectiveness of Schema Therapy and Integrated Schema Oriented Therapy on Components of Impulsivity in People with Borderline Personality Characteristic
Introduction: Impulsivity is one of the problems which Individuals with borderline personality characteristics suffered from them. According to this point, the aim of this study was to comparison of the effectiveness of schema therapy and integrated schema oriented therapy on impulsivity and their components among individuals with borderline personality characteristics. Methods: The research me...
متن کامل